163 research outputs found
Modeling Phishing Decision using Instance Based Learning and Natural Language Processing
Phishing is the practice of deceiving humans into disclosing sensitive information or inappropriately granting access to a secure system. Unfortunately, there is a severe lack of theoretical models to adequately explain and predict the cognitive dynamics underlying end-user susceptibility to phishing emails. This paper reports findings from an Instance-Based Learning (IBL) model developed to predict human response to emails obtained from a laboratory experiment. Particularly, this work investigates the effectiveness of using established natural language processing methods, such as LSA, GloVe, and BERT, to represent email text within IBL models. We found that using representations that consider contextual meanings assigned by humans could enable IBL agents to predict human response with high accuracy (>80%). In addition, we found that traditional NLP methods that capture semantic meanings in natural language may not be effective at representing how people may encode and recall email messages. We discuss the implications of these findings
Graph-Segmenter: Graph Transformer with Boundary-aware Attention for Semantic Segmentation
The transformer-based semantic segmentation approaches, which divide the
image into different regions by sliding windows and model the relation inside
each window, have achieved outstanding success. However, since the relation
modeling between windows was not the primary emphasis of previous work, it was
not fully utilized. To address this issue, we propose a Graph-Segmenter,
including a Graph Transformer and a Boundary-aware Attention module, which is
an effective network for simultaneously modeling the more profound relation
between windows in a global view and various pixels inside each window as a
local one, and for substantial low-cost boundary adjustment. Specifically, we
treat every window and pixel inside the window as nodes to construct graphs for
both views and devise the Graph Transformer. The introduced boundary-aware
attention module optimizes the edge information of the target objects by
modeling the relationship between the pixel on the object's edge. Extensive
experiments on three widely used semantic segmentation datasets (Cityscapes,
ADE-20k and PASCAL Context) demonstrate that our proposed network, a Graph
Transformer with Boundary-aware Attention, can achieve state-of-the-art
segmentation performance
ADD: An Automatic Desensitization Fisheye Dataset for Autonomous Driving
Autonomous driving systems require many images for analyzing the surrounding
environment. However, there is fewer data protection for private information
among these captured images, such as pedestrian faces or vehicle license
plates, which has become a significant issue. In this paper, in response to the
call for data security laws and regulations and based on the advantages of
large Field of View(FoV) of the fisheye camera, we build the first Autopilot
Desensitization Dataset, called ADD, and formulate the first
deep-learning-based image desensitization framework, to promote the study of
image desensitization in autonomous driving scenarios. The compiled dataset
consists of 650K images, including different face and vehicle license plate
information captured by the surround-view fisheye camera. It covers various
autonomous driving scenarios, including diverse facial characteristics and
license plate colors. Then, we propose an efficient multitask desensitization
network called DesCenterNet as a benchmark on the ADD dataset, which can
perform face and vehicle license plate detection and desensitization tasks.
Based on ADD, we further provide an evaluation criterion for desensitization
performance, and extensive comparison experiments have verified the
effectiveness and superiority of our method on image desensitization
LineMarkNet: Line Landmark Detection for Valet Parking
We aim for accurate and efficient line landmark detection for valet parking,
which is a long-standing yet unsolved problem in autonomous driving. To this
end, we present a deep line landmark detection system where we carefully design
the modules to be lightweight. Specifically, we first empirically design four
general line landmarks including three physical lines and one novel mental
line. The four line landmarks are effective for valet parking. We then develop
a deep network (LineMarkNet) to detect line landmarks from surround-view
cameras where we, via the pre-calibrated homography, fuse context from four
separate cameras into the unified bird-eye-view (BEV) space, specifically we
fuse the surroundview features and BEV features, then employ the multi-task
decoder to detect multiple line landmarks where we apply the center-based
strategy for object detection task, and design our graph transformer to enhance
the vision transformer with hierarchical level graph reasoning for semantic
segmentation task. At last, we further parameterize the detected line landmarks
(e.g., intercept-slope form) whereby a novel filtering backend incorporates
temporal and multi-view consistency to achieve smooth and stable detection.
Moreover, we annotate a large-scale dataset to validate our method.
Experimental results show that our framework achieves the enhanced performance
compared with several line detection methods and validate the multi-task
network's efficiency about the real-time line landmark detection on the
Qualcomm 820A platform while meantime keeps superior accuracy, with our deep
line landmark detection system.Comment: 29 pages, 12 figure
Proximity as a Service for the Use Case of Access Enhancement via Cellular Network-Assisted MobileDevice-to-Device
Device-to-Device (D2D) communication is a way to treat the User Equipments (UEs) not as terminals, but as a part of the network (helpers) for service provisioning. We propose a generic framework, namely Proximity as a Service (PaaS), formulate the helper selection problem, and design and prove a heuristic helper selection policy, ContAct based Proximity (CAP), which increases the service connectivity and continuity. Design Of Experiment (DOE) is a statistical methodology that rigorously designs and conducts an experiment, and maximizes the information obtained from that experiment. We apply DOE to explore the relationship (analytic expression) between four inputs (factors) and four metrics (responses). Since different factors have different regression levels, a unified four level full factorial experiment and cubic multiple regression analysis have been carried out. Multiple regression equations are provided to estimate the different contributions and the interactions between factors. Results show that transmission range and user density are dominant and monotonically increasing, but transmission range should be restricted because of interference and energy-efficiency. After obtaining the explicit close form expressions between factors and responses, optimal values of key factors are derived. A methodology (the e-constraint method) to solve the multiple-objective optimization problem has been provided and a Pareto-Optimal set of factors has been found through iteration. The fluctuation of the iterations is small and a specific solution can be chosen based on the particular scenarios (city center or countryside with different user density). The methodology of optimization informs the design rules of the operator, helping to find the optimal networking solution
Complete Solution for Vehicle Re-ID in Surround-view Camera System
Vehicle re-identification (Re-ID) is a critical component of the autonomous
driving perception system, and research in this area has accelerated in recent
years. However, there is yet no perfect solution to the vehicle
re-identification issue associated with the car's surround-view camera system.
Our analysis identifies two significant issues in the aforementioned scenario:
i) It is difficult to identify the same vehicle in many picture frames due to
the unique construction of the fisheye camera. ii) The appearance of the same
vehicle when seen via the surround vision system's several cameras is rather
different. To overcome these issues, we suggest an integrative vehicle Re-ID
solution method. On the one hand, we provide a technique for determining the
consistency of the tracking box drift with respect to the target. On the other
hand, we combine a Re-ID network based on the attention mechanism with spatial
limitations to increase performance in situations involving multiple cameras.
Finally, our approach combines state-of-the-art accuracy with real-time
performance. We will soon make the source code and annotated fisheye dataset
available.Comment: 11 pages, 10 figures. arXiv admin note: substantial text overlap with
arXiv:2006.1650
Maximum likelihood for high-noise group orbit estimation and single-particle cryo-EM
Motivated by applications to single-particle cryo-electron microscopy
(cryo-EM), we study several problems of function estimation in a low SNR
regime, where samples are observed under random rotations of the function
domain. In a general framework of group orbit estimation with linear
projection, we describe a stratification of the Fisher information eigenvalues
according to a sequence of transcendence degrees in the invariant algebra, and
relate critical points of the log-likelihood landscape to a sequence of
method-of-moments optimization problems. This extends previous results for a
discrete rotation group without projection.
We then compute these transcendence degrees and the forms of these moment
optimization problems for several examples of function estimation under
and rotations, including a simplified model of cryo-EM as introduced by
Bandeira, Blum-Smith, Kileel, Perry, Weed, and Wein. For several of these
examples, we affirmatively resolve numerical conjectures that
-order moments are sufficient to locally identify a generic signal
up to its rotational orbit.
For low-dimensional approximations of the electric potential maps of two
small protein molecules, we empirically verify that the noise-scalings of the
Fisher information eigenvalues conform with these theoretical predictions over
a range of SNR, in a model of rotations without projection
- …